Interactive Perception : From Scenes to Objects
نویسنده
چکیده
This thesis builds on the observation that robots, like humans, do not have enough experience to handle all situations from the start. Therefore they need tools to cope with new situations, unknown scenes and unknown objects. In particular, this thesis addresses objects. How can a robot realize what objects are if it looks at a scene and has no knowledge about objects? How can it recover from situations where its hypotheses about what it sees are wrong? Even if it has built up experience in form of learned objects, there will be situations where it will be uncertain or mistaken, and will therefore still need the ability to correct errors. Much of our daily lives involves interactions with objects, and the same will be true robots existing among us. Apart from being able to identify individual objects, the robot will therefore need to manipulate them. Throughout the thesis, di erent aspects of how to deal with these questions is addressed. The focus is on the problem of a robot automatically partitioning a scene into its constituting objects. It is assumed that the robot does not know about specific objects, and is therefore considered inexperienced. Instead a method is proposed that generates object hypotheses given visual input, and then enables the robot to recover from erroneous hypotheses. This is done by the robot drawing from a human’s experience, as well as by enabling it to interact with the scene itself and monitoring if the observed changes are in line with its current beliefs about the scene’s structure. Furthermore, the task of object manipulation for unknown objects is explored. This is also used as a motivation why the scene partitioning problem is essential to solve. Finally aspects of monitoring the outcome of a manipulation is investigated by observing the evolution of flexible objects in both static and dynamic scenes. All methods that were developed for this thesis have been tested and evaluated on real robotic platforms. These evaluations show the importance of having a system capable of recovering from errors and that the robot can take advantage of human experience using just simple commands.
منابع مشابه
Consistency effects between objects in scenes.
How does context influence the perception of objects in scenes? Objects appear in a given setting with surrounding objects. Do objects in scenes exert contextual influences on each other? Do these influences interact with background consistency? In three experiments, we investigated the role of object-to-object context on object and scene perception. Objects (Experiments 1 and 3) and background...
متن کاملObject detection in natural scenes: Independent effects of spatial and category-based attention
Humans are remarkably efficient in detecting highly familiar object categories in natural scenes, with evidence suggesting that such object detection can be performed in the (near) absence of attention. Here we systematically explored the influences of both spatial attention and category-based attention on the accuracy of object detection in natural scenes. Manipulating both types of attention ...
متن کاملHow does context influence the perception of objects
and scenes? Although recognition experiments typically investigate objects in isolation, objects in the world rarely appear without some context. Objects are always located spatially within a setting and usually appear with other objects. Does contextual information affect recognition? For example, a new acquaintance may be recognized more quickly in the presence of a mutual friend than alone. ...
متن کاملMoving Objects Tracking Using Statistical Models
Object detection plays an important role in successfulness of a wide range of applications that involve images as input data. In this paper we have presented a new approach for background modeling by nonconsecutive frames differencing. Direction and velocity of moving objects have been extracted in order to get an appropriate sequence of frames to perform frame subtraction. Stationary parts of ...
متن کاملAcquisition Viewpoints - Efficient and Versatile View - Dependent Modeling of Real - World Scenes
Three dimensional modeling is a severe bottleneck for computer graphics applications. Manual modeling is time consuming and, even so, the resulting models fail to capture the true complexity of real world scenes. Automated modeling based on acquiring color and depth data is a promising alternative. We propose an automated modeling approach based on sampling the scene sparsely from a dense set o...
متن کاملInteractive Coding of Visual Spatial Frequency and Auditory Amplitude-Modulation Rate
Spatial frequency is a fundamental visual feature coded in primary visual cortex, relevant for perceiving textures, objects, hierarchical structures, and scenes, as well as for directing attention and eye movements. Temporal amplitude-modulation (AM) rate is a fundamental auditory feature coded in primary auditory cortex, relevant for perceiving auditory objects, scenes, and speech. Spatial fre...
متن کامل